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Abstract 

We study lower bounds for Locality Sensitive Hashing (LSH) in the strongest setting: point 
sets in {0, l} d under the Hamming distance. Recall that Ti is said to be an (r, cr, p, g)-sensitivc 
hash family if all pairs x,y £ {0, l} d with dist(x, y) < r have probability at least p of collision 
under a randomly chosen h £ H, whereas all pairs x,y £ {0,l} d with dist(x,y) > cr have 
probability at most q of collision. Typically, one considers d — > oo, with c > 1 fixed and q 
bounded away from 0. 

For its applications to approximate nearest neighbor search in high dimensions, the quality 
of an LSH family Ti is governed by how small its "rho parameter" p = ln(l/p)/ ln(l/g) is 
as a function of the parameter c. The seminal paper of Indyk and Motwani showed that for 
each c > 1, the extremely simple family TL — {x t— > Xj : i £ d} achieves p < 1/c. The 
only known lower bound, due to Motwani, Naor, and Panigrahy, is that p must be at least 
(e 1 ^ - \)/{e l / c + 1) > .46/c (minus o d (l)). 

In this paper we show an optimal lower bound: p must be at least 1/c (minus 0^(1)). This 
lower bound for Hamming space yields a lower bound of 1/c 2 for Euclidean space (or the unit 
sphere) and 1/c for the Jaccard distance on sets; both of these match known upper bounds. Our 
proof is simple; the essence is that the noise stability of a boolean function at e - ' is a log-convex 
function of t. 

Like the Motwani-Naor-Panigrahy lower bound, our proof relies on the assumption that 
q is not "tiny", meaning of the form 2~ e< - ci '. Some lower bound on q is always necessary, as 
otherwise it is trivial to achieve p = 0. The range of q for which our lower bound holds is the 
same as the range of q for which p accurately reflects an LSH family's quality. Still, we conclude 
by discussing why it would be more satisfying to find LSH lower bounds that hold for tiny q. 
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1 Locality Sensitive Hashing 



Locality Sensitive Hashing (LSH) is a widely-used algorithmic tool which brings the clas- 
sic technique of hashing to geometric settings. It was introduced for general metric spaces 
in the seminal work of Indyk and Motwani [IM98 . Indyk and Motwani showed that the 
important problem of (approximate) nearest neighbor search can be reduced to the prob- 
lem of devising good LSH families. Subsequently, numerous papers demonstrating the prac- 
tical utility of solving high-dimensional nearest neighbor search problems via the LSH ap- 
proach |GIM99I IBuhOll ICDF+Oll ISVD03I IRPH051 IDDCxR07] . For a survey on LSH, see Andoni 
and Indyk [AI08] . 

We recall the basic definition from [IM98 : 

Definition 1. Let (X, dist) be a distance spac^\, and let U be any finite or countably infinite set. 
Let r > 0, c > 1. A probability distribution TL over functions h : X — > U is (r, cr,p, (^-sensitive 
if for all x, y <E X, 

dist(x,y) < r => Pr [h{x) = h(y)] > p, 
dist(x,y) > cr Pr [h(x) — h(y)] < q, 

where q < p. We often refer to TL as a locally sensitive hash (LSH) family for (X, dist). 

As mentioned, the most useful application of LSH is to the approximate near neighbor problem 
in high dimensions: 

Definition 2. For a set of n points P in a metric space (X, dist), the (r, c)-near neighbor 
problem is to process the points into a data structure that supports the following type of query: 
given a point x G X, if there exists y G P with dist(x,y) < r, the data structure should return 
a point z G P such that dist(a;, z) < cr. 

Several important problems in computational geometry reduce to the approximate near 
neighbor problem, including approximate versions of nearest neighbor, furthest neighbor, close 
pair, minimum spanning tree, and facility location. For a short survey of these topics, see In- 
dyk |Ind04j . 

Regarding the reduction from (r, c)-near neighbor problem to LSH, it is usual (see |Ind01| 
IDIIM04j ) to credit roughly the following theorem to [IM98| IGIM99] : 

Theorem 1.1. Suppose Ti. is an (r, cr,p, q)-sensitive family for the metric space (X, dist). Then 
one can solve the (r,c)-near neighbor problem with a (randomized) data structure that uses 
0{n 1+p + dn) space and has query time dominated by 0(n p log 1 / (J (n)) hash function evaluations. 
(The preprocessing time is not much more than the space bound.) 

Here we are using the following: 

Definition 3. The rho parameter of an (r, cr, p, q)-sensitive LSH family TL is 

Please note that in Theorem 11.11 it is implicitly assumed |Ind09j that q is bounded away 
from 0. For "subconstant" values of q, the theorem does not hold. This point is discussed 
further in Section 2J 

Because of Theorem [TU there has been significant interest |DIIM041 ITT071 IAT081 |NeylO| 
in determining the smallest possible p that can be obtained for a given metric space and value 
of c. Constant factors are important here, especially for the most natural regime of c close to 1. 
For example, shrinking p by an additive .5 leads to time and space savings of 0(y / n). 



A metric space where the triangle inequality need not hold. 
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2 Previous work 



2.1 Upper bounds 

The original work of Indyk and Motwani [IM98j contains the following simple yet strong result: 

Theorem 2.1. There is an LSH family 7i for {0, l} d under the Hamming distance which for 
each c > 1 has rho parameter 

p(n) < -, 

c 

simultaneously for each r < d/c. 

In this theorem, the family is simply the uniform distribution over the d functions hi{x) = Xi. 
For a given c and r, this family is obviously (r, cr, 1 — r/d, 1 — cr /eQ-sensitive, whence 

, . ln(l/(l-r/d)) „1 

We remark that the upper bound of 1/c in Theorem 12.11 becomes tight only for asymptotically 
small r/d. Indyk and Motwani showed that the same bound holds for the closely related "Jac- 
card metric" (see [IM98] ) . and also extended Thcorcm l2.1l to an LSH family for the metric space 
ti (see also [AT06] h 

Perhaps the most natural setting is when the metric space is the usual d-dimensional Eu- 
clidean space Here, Andoni and Indyk [AI08] showed, roughly speaking, that p < 1/c 2 : 

Theorem 2.2. For any r > 0, c > 1, d > 1, there is a sequence of LSH families Tit for £ d 
satisfying 

limsupp(7Y t ) < — . 

(The complexity of evaluating a hash function h ~ TL t also increases as t increases.) 

For other i s distance/metric spaces, Datar, Immorlica, Indyk, and Mirrokni [DIIM04 have 
similarly shown @ 

Theorem 2.3. For any r > 0, c > 1, d > 1, and < s < 2, there is a sequence of LSH families 
Tit for if satisfying 

lira sup p(Ht) < max < — , - 

t^oo C C 

Other practical LSH families have been suggested for the Euclidean sphere [TT07] and £2 |NeylO| . 

2.2 Lower bounds 

There is one known result on lower bounds for LSH, due to Motwani, Naor, and Panigrahy [M NP07j : 

Theorem 2.4. Fix c > 1, < q < 1, and consider d — > 00. Then there exists some r — r(d) 
such that for any LSH family TL for {0, l} d under Hamming distance which is (r 7 cr,p,q)-sensitive 
must satisfy 

rn-t\ ^ ex P(V c ) ~ 1 /-1 \ 
P[H) > exp(l/c) + 1 - 0d(1) ' 

The metric setting of {0, l} d under Hamming distance is the most powerful setting for lower 
bounds; as Motwani, Naor, and Panigrahy note, one can immediately deduce a lower bound of 

expq/c 8 ) - 1 
exp(l/c s ) + 1 ° d{ ' 



2 Please note that in Pan06, MNP07 it is stated that [DIIM04] also improves the Indyk-Motwani 1/c upper bound 
for l\ when c < 10. However this is in error. 
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for the setting of £ d . This is simply because ||x — y|| s = \x — i when x,y € {0, l} d . 

As c — * co, the lower bound in Theorem 12.41 approaches This is a factor of 2 away from 
the upper bound of Indyk and Motwani. The gap is slightly larger in the more natural regime 
of c close to 1; here one only has that piTi.) > f^j-j ~ ■ 

Note that in Theorem 12.41 the parameter q is fixed before one lets d tend to oo; i.e., q is 
assumed to be at least a "constant" . Even though this is the same assumption implicitly made 
in the application of LSH to near- neighbors (Theorem II. lj) , we feel it is not completely satis- 
factory. In fact, as stated in [MNP07] . Theorem 12.41 still holds so long as q > 2~°( d \ Our new 
lower bound for LSH also holds for this range of q. But we believe the most satisfactory lower 
bound would hold even for "tiny" q, meaning q — 2~®( d \ This point is discussed further in 
Section |U 

We close by mentioning the recent work of Panigrahy, Talwar, and Wieder [PTW08 which 
obtains a time/space lower bound for the (r, c)-near neighbor problem itself in several metric 
space settings, including {0, l} d under Hamming distance, and £2- 



3 Our result 

In this work, we improve on Theorem 12.41 by obtaining a sharp lower bound of - — o,i(l) for 
every c > 1. This dependence on c is optimal, by the upper bound of Indyk and Motwani. The 
precise statement of our result is as follows: 

Theorem 3.1. Fix d € N, 1 < c < 00. and < q < 1. Then for a certain choice of < r < 1, 
any (rd, crd,p, q)-sensitive hash family Ti for {0, l} d under Hamming distance must satisfy 

Here, the precise meaning of the O(-) expression is 

ln(2/g) ( d \ 

where K is a universal constant, and we assume d/\n(2/q) > 2, say. 

As mentioned, the lower bound is only of the form - — 0^(1) under the assumption that 
q > 2~°( d \ For q of the form 2~ d / B for a large constant £?, the bound (JTJ) still gives some useful 
information. 



As with the Motwani-Naor-Panigrahy result, because our lower bound is for {0, l} d we may 
immediately conclude: 

Corollary 3.2. Theorem \3.1\ also holds for LSH families for the distance space £ s , < s < 00, 
with the lower bound l/c s replacing 1/c. 

This lower bound matches the known upper bounds for Euclidean space s = 2 ( |AI08j ) and 
< s < 1 ( DIIM04]). It seems reasonable to conjecture that it is also tight at least for 1 < s < 2. 

Finally, the lower bound in Theorem l3 . 1 1 also holds for the Jaccard distance on sets, matching 
the upper bound of Indyk and Motwani [IM98] . We explain why this is true in Section 13.21 
although we omit the very minor necessary changes to the proof details. 
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3.1 Noise stability 

Our proof of Theorem 13.11 requires some facts about boolean noise stability. We begin by 
recalling some basics of the analysis of boolean functions. 

Definition 4. For < p < 1, we say that (x,y) are p-correlated random strings in {0, l} d 
if x is chosen uniformly at random and y is formed by rerandomizing each coordinate of x 
independently with probability 1 — p. 

Definition 5. Given f : {0, l} d — > R, the noise stability of / at p is defined to be 

S/(p)= E [f(x)f{y)\. 

p-correlated 

We can extend the definition to functions f : {0, l} d — > R* 7 ria 

S/(p)= E [</(x), /(»)>], 

p-corre£a£ed 

where (to, z) = X^ec/ ^i- 2 * * s ^ e usual inner product^ 

Proposition 3.3. Let / : {0, l} d — » R* 7 and write /(S) /or ifte usual Fourier coefficient of f 
associated with S C [d]; i.e., 

/(^) = ^ E /wnHr 6RU 

a;e{0,l} d ieS 

Then 

Mp)= E II/iMp 1 * 1 - 

SC[d] 

(This formula is standard when / has range R; see, e.g., |O'D03j . The case when / has range 
M. u follows by repeating the standard proof.) 

We are particularly interested in hash functions h : {0, l} d — > U; we view these also as 
functions {0, l} d — ► R* 7 by identifying i E U with the vector ej G R* 7 , which has a 1 in the «th 
coordinate and a in all other coordinates. Under this identification, (h(x),h(y)) becomes the 
0-1 indicator of the event h(x) = h(y). Hence for a fixed hash function h, 

S h (p) = Pr [h{x) = h(y)\. (2) 

/^-correlated 

We also extend the notion of noise stability to hash families: 
Definition 6. If "H is a hash family on {0, l} d , we define 

Sn(p) = E i [S h (p)]. 

n~ rL 

By combining this definition with equation |2]) and Proposition 13. 3[ we immediately deduce: 
Proposition 3.4. Let H be a hash family on {0, 1} . Then 

§ h (p) = Pr [M*) = Mv)] - E „ E JMS)\\i]p lsl - 

(x,y) p-corr'd SC[d] 

Finally, it is sometimes more natural to express the parameter p as p — e~', where t € [0, oo). 
(For example, we can think of a p-correlated pair (x, y) by taking x to be uniformly random 
and y to be the string that results from running the standard continuous-time Markov Chain 
on {0, l} d , starting from x, for time td.) We make the following definition: 

Definition 7. Fort G [0,oo), we define K^(t) = §/j(e ), and we similarly define K^(t). 
3 In the case that U is countably infinite, we require our functions / to have ||/(a;)|]2 < oo for all x € {0, l} d . 
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3.2 The proof, modulo some tedious calculations 

We now present the essence of our proof of Theorem 13.11 It will be quite simple to see how it 
gives a lower bound of the form i — o,j(l) (assuming q is not tiny). Some very tedious calcula- 
tions (Chernoff bounds, elementary inequalities, etc.) are needed to get the precise statement 
given in Theorem 13. 11 the formal proof is therefore deferred to Section [5l 

Let Ti be a hash family on {0, l} d , and let us consider 

K n (t)= Pr [h(x)=h(y)}. (3) 

(x 7 y) e~ -corr'd 

Let us suppose that t is very small, in which case e _t f» 1 — t. When (a;, y) are (1 — i)-correlated 
strings, it means that y is formed from the random string x by rerandomizing each coordinate 
with probability t. This is the same as flipping each coordinate with probability t/2. Thus if we 
think of d as large, a simple Chernoff bound shows that the Hamming distance dist(a;, y) will 
be very close to (t/2)d with overwhelming probability]^ 

Suppose now that Tt is ((t/2)d + o(d), (ct/2)d — o(d),p, ^-sensitive, so the distance ratio is 
c — Od(l). In ([3]), regardless of h we will almost surely have dist(cc,y) < (t/2) + o(d); hence 
^n(t) > p — Od(l). Similarly, we deduce K-u(ct) < q + Od(l). Hence, neglecting the 0^(1) terms, 
we get 

<m = Hl/p) > ln (V K ^W) 

P[n > ln(l/«) ~ ha(l/K„(et))' 
We then deduce the desired lower bound of 1/c from the following theorem and its corollary: 
Theorem 3.5. For any hash family Tt on {0, l} d , the function K-^(t) is log-convex in t. 

Proof. From Proposition 13.41 we have 

K H (t)= V[\\h(S)\\l}e- tlsl . 

SC[d] 

Thus is log-convex, being a nonnegative linear combination of log-convex functions e - *' 5 ' . 

□ 

Corollary 3.6. For any hash family H on {0, l} d , t > 0, and c > 1, 

In(l/K w (t)) > 1 



ln(l/K w (ct)) ~ c 



Proof. By log-convexity, K H (t) < Kn^t) 1 ^ ■ K H (Qf- l / c = K^ct) 1 / . Here we used the fact 
that Kn(Q) = 1, which is immediate from the definitions because e _0 -correlated strings are 
always identical. The result follows. □ 

As mentioned, we give the careful proof keeping track of approximations in Section [5j But 
first, we note what we view as a shortcoming of the proof: after deducing Kn(ct) > gr — 0^(1), we 
wish to "neglect" the additive Od(l) term. This requires that 0^(1) indeed be negligible compared 
to q\ Being more careful, the Od(l) arises from a Chernoff bound applied to a Binomial(c?, ct) 
random variable, where t > is very small. So to be more precise, the error term is of the form 
exp(— ed), and hence is only negligible if q > 2~°( d \ 



4 Similarly, if we think of x and y as subsets of [d], their Jaccard distance will be very close to t/(l + t/2) w t 
with overwhelming probability. With this observation, one obtains our lower bound on LSH families for the Jaccard 
distance on sets. 
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4 Discussion 



4.1 On the reduction from LSH to near neighbor data structures 

As described in Section [TJ it is normally stated that the quality of an (r, cr, p, g)-sensitive LSH 
family TL is governed by p = \n(l/p)/ ln(l/q), and more specifically that TL can be used to solve 
the (r, c)-near neighbor problem with roughly 0(n 1+p ) space and query time 0(n p ). However, 
this involves the implicit assumption that q is bounded away from 0. 

It is easy to see that some lower bound on q is essential. Indeed, for any (finite, say) distance 
space (X, dist) there is a trivially "optimal" LSH family for any r and c: For each pair 
with dist(a;, y) < r, define h x>y by setting h x ^ y (x) = h x , y (y) = and letting h XtV (z) have distinct 
positive values for all z ^ x,y. If TL is the uniform distribution over all such h XyV , then p > 
and q = 0, leading to p(TL) = 0. 

To see why this trivial solution is not useful, and what lower bound on q is desirable, we recall 
some aspects of the Indyk-Motwani reduction from LSH families to (r, c)-near neighbor data 
structures. Suppose one wishes to build an (r, c)-near neighbor data structure for an n-point 
subset P of the metric space (X, dist). The first step in [IM98 is to apply the following: 

Powering Construction: Given an (r, cr,p, g)-sensitive family TL of functions X — > U and 
a positive integer k, we define the family TL® k by drawing h\, . . . , hj- independently from TL and 
forming the function h : X — > U k , h(x) = (hi(x), . . . , hk(x)). It is easy to check that TL® k is 
(r, cr,p k , g fe )-sensitive. 

Indyk and Motwani show that if one has an (r, cr, p' , q')-sensitive hash family with q' < 1/n, 
then one can obtain a (r, c)-near neighbor data structure with space roughly O(nfp') and query 
time roughly 0(l/p'). Thus given an arbitrary (r, cr,p, g)-sensitive family Ti, Indyk and Motwani 
suggest using the Powering Construction with k = log 1//(J (n) . The resulting TL ® fc is (r, cr, p' , 1 Jn)- 
sensitive, with p 1 — p k — n~ p , yielding an 0(n 1+p ) space, 0{n p ) time data structure. 

However this argument makes sense only if A; is a positive integer. For example, with the 
trivially "optimal" LSH family, we have q = and thus k — — oo. Indeed, whenever q < 1/n 
to begin with, one doesn't get 0(n 1+p ) space and 0(n p ) time, one simply gets 0{n/p) space 
and 0(1/ p) time. For example, a hypothetical LSH family with p = 1/n- 5 and q = 1/n 1 ' 5 has 
p = 1/3 but only yields an 0(n 15 ) space, 0(n 5 ) time near neighbor data structure. 

The assumption q > 1/n is still not enough for the deduction in Theorem 11.11 to hold pre- 
cisely. The reason is that the Indyk-Motwani choice of k may not be an integer. For example, 
suppose we design an (r, cr,p, g)-sensitive family TL with p = 1/n and q = 1/n . Then p = .5. 
However, we cannot actually get an 0(n 15 ) space, 0(n' 5 ) time data structure from this TL. The 
reason is that to get q k < 1/n, we need to take k = 4. Then p k — 1/w 6 , so we only get an 
0(n 1G ) space, 0(n 6 ) time data structure. 

The effect of rounding k up to the nearest integer is not completely eliminated unless one 
makes the assumption, implicit in Theorem II. li that q > 0(1). Under the weaker assumption 
that q > n^°^\ the conclusion of Theorem l 1 . 1 1 remains true up to n oty1 ^ factors. To be completely 
precise, one should assume q > 1/n and take k = [log 1 / (J (n)] . If we then use k < log 1 / g (n) + 1, 
the Powering Construction will yield an LSH family with q' < 1/n and p' = (n/q)~ p . In this 
way, one obtains a refinement of Theorem 11.11 with no additional assumptions: 

Theorem 4.1. Suppose TL is an (r, cr,p, q) -sensitive family for the metric space (X, dist). Then 
for n-point subsets of X (and assuming q > 1/n), one can solve the (r,c)-near neighbor problem 
with a (randomized) data structure that usesn-0((n/q) p +d) space and has query time dominated 
by 0((n/ q) p \og l j q (n)) hash function evaluations. 
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4.2 On assuming q is not tiny 



Let us return from the near-neighbor problem to the study of locality sensitive hashing itself. 
Because of the "trivial" LSH family, it is essential to impose some kind of lower bound on how 
small the parameter q is allowed to be. Motwani, Naor, and Panigrahy carry out their lower 
bound for LSH families on {0, l} d under the assumption that q > 0(1) , but also note that it 
goes through assuming q > 2-°W>. Our main result, Theorem 13. 11 is also best when q > 2 o( - d >, 
and is only nontrivial assuming q > 2~ d / B for a sufficiently large constant B. 

One may ask what the "correct" lower bound assumed on q should be. For the Indyk- 
Motwani application to (r, c)-near neighbor data structures, the answer seems obvious: "1/n". 
Indeed, since the Indyk-Motwani reduction immediately uses Powering to reduce the q param- 
eter down to the most meaningful LSH lower bounds would simply involve fixing q = 1/n 
and trying to lower bound p. 

There is an obvious catch here, though, which is that in the definition of LSH, there is 
no notion of "n"\ Still, in settings such as {0, l} d which have a notion of dimension, d, it 
seems reasonable to think that applications will have n = 2 e ( d \ In this case, to maintain the 
Indyk-Motwani Theorem [43] up to factors one would require q 

> 2-°( rf ). This is precisely 

the assumption that this paper and the Motwani-Naor-Panigrahy paper have made. Still, we 
believe that the most compelling kind of LSH lower bound for {0, l} d would be nontrivial even 
for q — 2~ d / b with a "medium" constant b, say b = 10. We currently do not have such a lower 
bound. 



5 Proof details 

We require the following lemma, whose proof follows easily from Proposition 13.41 and the defi- 
nition of hash family sensitivity: 

Lemma 5.1. Let Ti be an (r, cr, p, q)-sensitive hash family on {0, l} d and suppose (x,y) is a 
pair of e~ u -correlated random strings. Then 

p(l - Pr[dist(x, y) > r]) < K H {u) < q + Pr[dist(a;, y) < cr]. 

We now prove Theorem 13. 1[ which for convenience we slightly rephrase as follows: 

Theorem 5.2. Fix d G N, 1 < c < oo, and < q < 1. Then for a certain choice of < e < 1. 
any ((e/c)d, ed,p, q)-sensitive hash family for {0, l} d under Hamming distance must satisfy 

P = ^f\> l --K- K d^\ 
ln(l/g) c 



where K is a universal constant, 



X(d, q) = i^lJjj l n 



In(2/g) 



and we assume d/hi(2/q) > 2, say. 

Proof. Let < A = A(c, d, q) < .005 be a small quantity to be chosen later, and let e = .005A. 
Suppose that Ti is an ((e/c)d, ed,p, (^-sensitive hash family for {0, l} d . Our goal is to lower 
bound p = ln(l/p)/ln(l/g). By the Powering Construction we may assume that q < 1/e, 
and hence will use ln(l/g) > 1 without further comment. Define also t — 2e(l + A/2) and 
c! = c(l + A). 

Let be exp(— i/c')-correlated random strings and let (3:2,1/2) be exp(— t)-correlated 

random strings. Using the two bounds in Lemma 15. II separately, we have 



K w (t/e0>p(l-ei), K H (t)<q + e 2 , 
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where 

e% = Pr[dist(xi,yi) > (e/c)d\, e 2 = Pr[dist(x 2 , y 2 ) < erf]- 
By Corollary [3?6l we have 

1 ln(l/K„ft/cQ) In (g^) = ln(l/p) + ln(l/(l- ei )) 

c ' " ln(l/K w (*)) " ln(^) ln(l/g)+Ml/(l + e 2 /g))- U 

We will use the following estimates: 

1 1 1, A , 1 A 

7 = c(iTa)^ (1 - A) = "7' (5) 

ln(l/(l - ei)) < 1.01ei, (6) 
ln(l/«) + ln(l/(l + e 2 /q)) > m(l/g) - e./g = ln(l/g) (l - ) . (7) 

For ([6|) we made the following 

assumption: e\ < .01. (8) 
We will also ensure that the quantity in ([7]) is positive by making the following 

assumption: e 2 < qln(l/q). (9) 
Substituting the three estimates (©-([7]) into ([4]) we obtain 

1 _ A < ln(l/p) + l.Olei ln(l/p) + l.Olei ^^M/ e 2 

c c ~ ln(l/g)(l - g i n f 1/g) ) MVS) V c c / V Q-MVs) 

ln(l/p) 1 _ A e 2 1.01 ei 



ln(l/g) ~ c c qhx(l/q) ln(l/<?)' 
Thus we have established 

1 e e _ A + l.Olei + e 2 

— c c ln(l/g) gln(l/g) 

We now estimate e\ and e 2 in terms of A (and e), after which we will choose A so as to 
minimize e. By definition, e\ is the probability that a Binomial(d, 771) random variable exceeds 
(e/c)d, where rjx = (1 — exp(t/c'))/2. Let us select Si so that (1 + Si)r]i = e/c. Thus 

e 2e/c 2e/c 1 + A 

5i = 1 = '-. — -— - 1 > —r- - 1 = — :L r77r - 1 > .498A. 

an l-exp(-i/c') ~ t/c' l + A/2 

Here we used the definitions of t and c', and then the assumption A < .005. Using a standard 
Chernoff bound, we conclude 

pi = Pr[Bmomial(t2,77x) > (1 + Si)r)id\ < exp f~ 2 + a 1 7?ld ) < exp (^^8 ??ld ) ' ^ 
using the fact that S 2 /(2 + S) is increasing in S, and A < .005 again. We additionally estimate 

l-exp(i/c') t/c 1 - {t/c 1 ) 2 12 ,,,,,, „o /i t\ e/l + A/2\ e 

m = > ~ y - ' = {t/2c')~{t/2c'f > .99(t/2c') = .99- [-J^j > -98-. 

Here the second inequality used t/2c' < .01, which certainly holds since t/2c' < e = .005 A. The 
third inequality used A < .005. Substituting this into (jTTJ) we obtain our upper bound for e±, 

( A 2 e \ / .005A 3 \ / A 3 \ 

61 < 6XP I " 8Y5c d = 6XP "T257 d j < 6XP I " 2000^ J " (12) 
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Our estimation of e 2 is quite similar: 

e 2 = Pr[Binomial(d, r/ 2 ) < (1 - 5 2 )rj 2 d] < ex P (""y 7 ? 2 ^ ' ( 13 ) 

where T] 2 = (1 — exp(— 1))/2 and S 2 is chosen so that (1 — S 2 )i] 2 — e. This entails 

e 2e 2e 1 1 

So = 1 = 1 ; r > 1 7T7T = 1-^-, — 7Z — TTT = 1" 



rj 2 l-exp(-t)- t-t 2 /2 (t/2e)~e(t/2e) 2 (1 + A/2) - e(l + A/2) 2 

This expression is the reason we were forced to take e noticeably smaller than A. Using our 
specific setting e = .005A, we conclude 

So > 1 = 1 > 49A 

(1 + A/2)-e(l + A/2) 2 1 + .495A - .005A 2 - .00125A 3 " ' 

where we used A < .005 again. As for 772, we can lower bound it similarly to 771, obtaining 

772 > .99(4/2) = .99e(l + A/2) > .99e. 
Substituting our lower bounds for S 2 and 772 into (fT5|) yields 

e 2 < exp (- (- 49A ) . .gg e( /] < ex p f-^rrf) • (14) 



2000 

Plugging our upper bounds (TT21) . (fT4| for ei, e 2 into ([TO]) gives 

A l.Olexp(-^rf) exp(-^rf) 
' c In(l/g) + qHl/q) ' 1 J 

Finally, we would like to choose 

where A"i is an absolute constant. For K\ sufficiently large, this makes all three terms in the 
bound (fT5j) at most 

/ln(2/ g )\ 1/3 



2K 1 \(d,q) 1/3 = O 



\ d J 



This would establish the theorem. 

It only remains to check whether this is a valid choice for A. First, we note that with this 
choice, assumptions © and ([9]) follow from ([12]) and ([14]) (and increasing K\ if necessary). 
Second, we required that A < .005. This may not hold. However, if it fails then we have 

wj M/3 - 005 



We can then trivialize the theorem by taking K — (ifi/.005) 3 , making the claimed lower bound 
for p smaller than 1/c— 1/c 1 / 3 < 0. □ 
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